Neiva
Investigation of the Privacy Concerns in AI Systems for Young Digital Citizens: A Comparative Stakeholder Analysis
Campbell, Molly, Barthwal, Ankur, Joshi, Sandhya, Shouli, Austin, Shrestha, Ajay Kumar
The integration of Artificial Intelligence (AI) systems into technologies used by young digital citizens raises significant privacy concerns. This study investigates these concerns through a comparative analysis of stakeholder perspectives. A total of 252 participants were surveyed, with the analysis focusing on 110 valid responses from parents/educators and 100 from AI professionals after data cleaning. Quantitative methods, including descriptive statistics and Partial Least Squares Structural Equation Modeling, examined five validated constructs: Data Ownership and Control, Parental Data Sharing, Perceived Risks and Benefits, Transparency and Trust, and Education and Awareness. Results showed Education and Awareness significantly influenced data ownership and risk assessment, while Data Ownership and Control strongly impacted Transparency and Trust. Transparency and Trust, along with Perceived Risks and Benefits, showed minimal influence on Parental Data Sharing, suggesting other factors may play a larger role. The study underscores the need for user-centric privacy controls, tailored transparency strategies, and targeted educational initiatives. Incorporating diverse stakeholder perspectives offers actionable insights into ethical AI design and governance, balancing innovation with robust privacy protections to foster trust in a digital age.
- North America > Canada > British Columbia > Vancouver Island > Regional District of Nanaimo > Nanaimo (0.15)
- South America > Colombia > Huila Department > Neiva (0.04)
- North America > United States > California > Ventura County > Thousand Oaks (0.04)
- North America > Canada > Saskatchewan (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Information Technology > Security & Privacy (1.00)
- Education > Educational Setting > K-12 Education (0.68)
- Information Technology > Security & Privacy (1.00)
- Information Technology > Communications > Social Media (0.93)
- Information Technology > Data Science > Data Mining (0.73)
- (4 more...)
Evidential Deep Learning for Probabilistic Modelling of Extreme Storm Events
Khot, Ayush, Luo, Xihaier, Kagawa, Ai, Yoo, Shinjae
Uncertainty quantification (UQ) methods play an important role in reducing errors in weather forecasting. Conventional approaches in UQ for weather forecasting rely on generating an ensemble of forecasts from physics-based simulations to estimate the uncertainty. However, it is computationally expensive to generate many forecasts to predict real-time extreme weather events. Evidential Deep Learning (EDL) is an uncertainty-aware deep learning approach designed to provide confidence about its predictions using only one forecast. It treats learning as an evidence acquisition process where more evidence is interpreted as increased predictive confidence. We apply EDL to storm forecasting using real-world weather datasets and compare its performance with traditional methods. Our findings indicate that EDL not only reduces computational overhead but also enhances predictive uncertainty. This method opens up novel opportunities in research areas such as climate risk assessment, where quantifying the uncertainty about future climate is crucial.
- North America > United States > Illinois (0.04)
- Asia > Singapore (0.04)
- South America > Colombia > Huila Department > Neiva (0.04)
- (2 more...)
- Energy (0.94)
- Government > Regional Government > North America Government > United States Government (0.46)
SoccerGuard: Investigating Injury Risk Factors for Professional Soccer Players with Machine Learning
Bartels, Finn, Xing, Lu, Midoglu, Cise, Boeker, Matthias, Kirsten, Toralf, Halvorsen, Pål
We present SoccerGuard, a novel framework for predicting injuries in women's soccer using Machine Learning (ML). This framework can ingest data from multiple sources, including subjective wellness and training load reports from players, objective GPS sensor measurements, third-party player statistics, and injury reports verified by medical personnel. We experiment with a number of different settings related to synthetic data generation, input and output window sizes, and ML models for prediction. Our results show that, given the right configurations and feature combinations, injury event prediction can be undertaken with considerable accuracy. The optimal results are achieved when input windows are reduced and larger combined output windows are defined, in combination with an ideally balanced data set. The framework also includes a dashboard with a user-friendly Graphical User Interface (GUI) to support interactive analysis and visualization.
- North America (0.14)
- Europe > Norway > Eastern Norway > Oslo (0.05)
- Europe > Germany > Saxony > Leipzig (0.04)
- (2 more...)
- Leisure & Entertainment > Sports > Soccer (1.00)
- Health & Medicine (1.00)
Enhancing stop location detection for incomplete urban mobility datasets
Bertè, Margherita, Ibrahimli, Rashid, Koopmans, Lars, Valgañón, Pablo, Zomer, Nicola, Colombi, Davide
Stop location detection, within human mobility studies, has an impacts in multiple fields including urban planning, transport network design, epidemiological modeling, and socio-economic segregation analysis. However, it remains a challenging task because classical density clustering algorithms often struggle with noisy or incomplete GPS datasets. This study investigates the application of classification algorithms to enhance density-based methods for stop identification. Our approach incorporates multiple features, including individual routine behavior across various time scales and local characteristics of individual GPS points. The dataset comprises privacy-preserving and anonymized GPS points previously labeled as stops by a sequence-oriented, density-dependent algorithm. We simulated data gaps by removing point density from select stops to assess performance under sparse data conditions. The model classifies individual GPS points within trajectories as potential stops or non-stops. Given the highly imbalanced nature of the dataset, we prioritized recall over precision in performance evaluation. Results indicate that this method detects most stops, even in the presence of spatio-temporal gaps and that points classified as false positives often correspond to recurring locations for devices, typically near previous stops. While this research contributes to mobility analysis techniques, significant challenges persist. The lack of ground truth data limits definitive conclusions about the algorithm's accuracy. Further research is needed to validate the method across diverse datasets and to incorporate collective behavior inputs.
- Asia > Japan > Honshū > Chūbu > Toyama Prefecture > Toyama (0.05)
- Europe > Spain > Galicia > Madrid (0.05)
- North America > United States > New York (0.04)
- (7 more...)
- Telecommunications (0.68)
- Information Technology (0.68)
- Health & Medicine (0.49)
- Information Technology > Communications (1.00)
- Information Technology > Data Science > Data Mining > Big Data (0.67)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.52)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.49)
A Masked Semi-Supervised Learning Approach for Otago Micro Labels Recognition
Shang, Meng, Dedeyne, Lenore, Dupont, Jolan, Vercauteren, Laura, Amini, Nadjia, Lapauw, Laurence, Gielen, Evelien, Verschueren, Sabine, Varon, Carolina, De Raedt, Walter, Vanrumste, Bart
The Otago Exercise Program (OEP) serves as a vital rehabilitation initiative for older adults, aiming to enhance their strength and balance, and consequently prevent falls. While Human Activity Recognition (HAR) systems have been widely employed in recognizing the activities of individuals, existing systems focus on the duration of macro activities (i.e. a sequence of repetitions of the same exercise), neglecting the ability to discern micro activities (i.e. the individual repetitions of the exercises), in the case of OEP. This study presents a novel semi-supervised machine learning approach aimed at bridging this gap in recognizing the micro activities of OEP. To manage the limited dataset size, our model utilizes a Transformer encoder for feature extraction, subsequently classified by a Temporal Convolutional Network (TCN). Simultaneously, the Transformer encoder is employed for masked unsupervised learning to reconstruct input signals. Results indicate that the masked unsupervised learning task enhances the performance of the supervised learning (classification task), as evidenced by f1-scores surpassing the clinically applicable threshold of 0.8. From the micro activities, two clinically relevant outcomes emerge: counting the number of repetitions of each exercise and calculating the velocity during chair rising. These outcomes enable the automatic monitoring of exercise intensity and difficulty in the daily lives of older adults.
- Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.05)
- South America > Colombia > Huila Department > Neiva (0.04)
- Europe > Sweden > Stockholm > Stockholm (0.04)
- (2 more...)
- Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.85)
An Active Learning Framework with a Class Balancing Strategy for Time Series Classification
Training machine learning models for classification tasks often requires labeling numerous samples, which is costly and time-consuming, especially in time series analysis. This research investigates Active Learning (AL) strategies to reduce the amount of labeled data needed for effective time series classification. Traditional AL techniques cannot control the selection of instances per class for labeling, leading to potential bias in classification performance and instance selection, particularly in imbalanced time series datasets. To address this, we propose a novel class-balancing instance selection algorithm integrated with standard AL strategies. Our approach aims to select more instances from classes with fewer labeled examples, thereby addressing imbalance in time series datasets. We demonstrate the effectiveness of our AL framework in selecting informative data samples for two distinct domains of tactile texture recognition and industrial fault detection. In robotics, our method achieves high-performance texture categorization while significantly reducing labeled training data requirements to 70%. We also evaluate the impact of different sliding window time intervals on robotic texture classification using AL strategies. In synthetic fiber manufacturing, we adapt AL techniques to address the challenge of fault classification, aiming to minimize data annotation cost and time for industries. We also address real-life class imbalances in the multiclass industrial anomalous dataset using our class-balancing instance algorithm integrated with AL strategies. Overall, this thesis highlights the potential of our AL framework across these two distinct domains.
- Europe > Switzerland > Zürich > Zürich (0.14)
- South America > Colombia > Huila Department > Neiva (0.04)
- North America > United States > Wisconsin > Dane County > Madison (0.04)
- (4 more...)
- Workflow (1.00)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- (2 more...)
- Health & Medicine (1.00)
- Energy (0.67)
- Banking & Finance (0.67)
fakenewsbr: A Fake News Detection Platform for Brazilian Portuguese
Giordani, Luiz, Darú, Gilsiley, Queiroz, Rhenan, Buzinaro, Vitor, Neiva, Davi Keglevich, Guzmán, Daniel Camilo Fuentes, Henriques, Marcos Jardel, Junior, Oilson Alberto Gonzatto, Louzada, Francisco
The proliferation of fake news has become a significant concern in recent times due to its potential to spread misinformation and manipulate public opinion. This paper presents a comprehensive study on detecting fake news in Brazilian Portuguese, focusing on journalistic-type news. We propose a machine learning-based approach that leverages natural language processing techniques, including TF-IDF and Word2Vec, to extract features from textual data. We evaluate the performance of various classification algorithms, such as logistic regression, support vector machine, random forest, AdaBoost, and LightGBM, on a dataset containing both true and fake news articles. The proposed approach achieves high accuracy and F1-Score, demonstrating its effectiveness in identifying fake news. Additionally, we developed a user-friendly web platform, fakenewsbr.com, to facilitate the verification of news articles' veracity. Our platform provides real-time analysis, allowing users to assess the likelihood of fake news articles. Through empirical analysis and comparative studies, we demonstrate the potential of our approach to contribute to the fight against the spread of fake news and promote more informed media consumption.
- South America > Brazil > São Paulo (0.05)
- South America > Colombia > Huila Department > Neiva (0.04)
- Europe > Ireland (0.04)
- Europe > Czechia > South Moravian Region > Brno (0.04)
Re-imagining health and well-being in low resource African settings using an augmented AI system and a 3D digital twin
Moodley, Deshendran, Seebregts, Christopher
This paper discusses and explores the potential and relevance of recent developments in artificial intelligence (AI) and digital twins for health and well-being in low-resource African countries. We use the case of public health emergency response to disease outbreaks and epidemic control. There is potential to take advantage of the increasing availability of data and digitization to develop advanced AI methods for analysis and prediction. Using an AI systems perspective, we review emerging trends in AI systems and digital twins and propose an initial augmented AI system architecture to illustrate how an AI system can work with a 3D digital twin to address public health goals. We highlight scientific knowledge discovery, continual learning, pragmatic interoperability, and interactive explanation and decision-making as essential research challenges for AI systems and digital twins.
- Africa > South Africa > Western Cape > Cape Town (0.05)
- Africa > Sub-Saharan Africa (0.04)
- Africa > Uganda (0.04)
- (6 more...)
- Research Report (1.00)
- Overview (1.00)
Exploring ordered patterns in the adjacency matrix for improving machine learning on complex networks
Neiva, Mariane B., Bruno, Odemir M.
The use of complex networks as a modern approach to understanding the world and its dynamics is well-established in literature. The adjacency matrix, which provides a one-to-one representation of a complex network, can also yield several metrics of the graph. However, it is not always clear that this representation is unique, as the permutation of lines and rows in the matrix can represent the same graph. To address this issue, the proposed methodology employs a sorting algorithm to rearrange the elements of the adjacency matrix of a complex graph in a specific order. The resulting sorted adjacency matrix is then used as input for feature extraction and machine learning algorithms to classify the networks. The results indicate that the proposed methodology outperforms previous literature results on synthetic and real-world data.
- South America > Colombia > Huila Department > Neiva (0.04)
- South America > Brazil > São Paulo (0.04)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
A Network Classification Method based on Density Time Evolution Patterns Extracted from Network Automata
Zielinski, Kallil M. C., Ribas, Lucas C., Machicao, Jeaneth, Bruno, Odemir M.
Network modeling has proven to be an efficient tool for many interdisciplinary areas, including social, biological, transport, and many other real world complex systems. In addition, cellular automata (CA) are a formalism that has been studied in the last decades as a model for exploring patterns in the dynamic spatio-temporal behavior of these systems based on local rules. Some studies explore the use of cellular automata to analyze the dynamic behavior of networks, denominating them as network automata (NA). Recently, NA proved to be efficient for network classification, since it uses a time-evolution pattern (TEP) for the feature extraction. However, the TEPs explored by previous studies are composed of binary values, which does not represent detailed information on the network analyzed. Therefore, in this paper, we propose alternate sources of information to use as descriptor for the classification task, which we denominate as density time-evolution pattern (D-TEP) and state density time-evolution pattern (SD-TEP). We explore the density of alive neighbors of each node, which is a continuous value, and compute feature vectors based on histograms of the TEPs. Our results show a significant improvement compared to previous studies at five synthetic network databases and also seven real world databases. Our proposed method demonstrates not only a good approach for pattern recognition in networks, but also shows great potential for other kinds of data, such as images.
- South America > Brazil > São Paulo (0.05)
- Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)
- South America > Colombia > Huila Department > Neiva (0.04)
- (4 more...)